Content-Based Table Retrieval for Web Queries
نویسندگان
چکیده
Understanding the connections between unstructured text and semi-structured table is an important yet neglected problem in natural language processing. In this work, we focus on content-based table retrieval. Given a query, the task is to find the most relevant table from a collection of tables. Further progress towards improving this area requires powerful models of semantic matching and richer training and evaluation resources. To remedy this, we present a ranking based approach, and implement both carefully designed features and neural network architectures to measure the relevance between a query and the content of a table. Furthermore, we release an open-domain dataset that includes 21,113 web queries for 273,816 tables. We conduct comprehensive experiments on both real world and synthetic datasets. Results verify the effectiveness of our approach and present the challenges for this task.
منابع مشابه
Investigating the Impact of Authors’ Rank in Bibliographic Networks on Expertise Retrieval
Background and Aim: this research investigates the impact of authors’ rank in Bibliographic networks on document-centered model of Expertise Retrieval. Its purpose is to find out what kind of authors’ ranking in bibliographic networks can improve the performance of document-centered model. Methodology: Current research is an experimental one. To operationalize research goals, a new test colle...
متن کاملAn analysis of failed queries for web image retrieval
This paper examines a large number of failed queries submitted to a web image search engine, including real users’ search terms and written requests. The results show that failed image queries have a much higher specificity than successful queries because users often employ various refined types to specify their queries. The study explores the refined types further, and finds that failed querie...
متن کاملA Visual Ontology Query Interface for Content- Based Image Retrieval
Various querying techniques have been developed for content-based image retrieval. We propose a Visual Ontology Query Interface for querying an OWL ontology built using content-based image retrieval techniques. With the query interface, users are able to formulate various ontology queries without having to know SPARQL, an ontology query language proposed by The World Wide Web Consortium.
متن کاملBayesian Semantics Incorporation to Web Content for Natural Language Information Retrieval
For the present work, we endeavor with the important aspect of information retrieval of Web content using natural language queries. Currently, markup languages and formalisms do not fully provide mechanisms for effective and accurate analysis of Web content but rather provide means for describing the content in a more human-centric approach. As a result, natural language queries cannot be handl...
متن کاملContent-Based Image Retrieval over the Web Using Query by Sketch and Relevance Feedback
This paper investigates the combined use of query by sketch and relevance feedback as techniques to ease user interaction and improve retrieval effectiveness in content-based image retrieval over the World Wide Web. To substantiate our ideas we implemented DrawSearch, a prototype image retrieval by content system that uses color, shape and texture to index and retrieve images. The system avails...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1706.02427 شماره
صفحات -
تاریخ انتشار 2017